Eliciting Meaningful Units from Speech

نویسندگان

  • Daniil Kocharov
  • Tatiana Kachkovskaia
  • Pavel A. Skrelin
چکیده

Elicitation of information structure from speech is a crucial step in automatic speech understanding. In terms of both production and perception, we consider intonational phrase to be the basic meaningful unit of information structure in speech. The current paper presents a method of detecting these units in speech by processing both the recorded speech and its textual representation. Using syntactic information, we split text into small groups of words closely connected with each other. Assuming that intonational phrases are built from these small groups, we use acoustic information to reveal their actual boundaries. The procedure was initially developed for processing Russian speech, and we have achieved the best published results for this language with F1 equal to 0.91. We assume that it may be adapted for other languages that have some amount of read speech resources, including under-resourced languages. For comparison we have evaluated it on English material (Boston University Radio Speech Corpus). Our results, F1 of 0.76, are comparable with the top systems designed for English.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interactive Effect of Pragmatic Eliciting Tasks on EFL Pre-intermediate Learners' Speaking Proficiency

The present study investigated the effect of pragmatic eliciting tasks on EFL pre-intermediate learners speaking proficiency. Thus this study aimed at comparing the English language learners who practiced pragmatic eliciting tasks and the ones who used traditional speaking activities such as questions and answers, discussion, etc. In doing so, 40 learners out of 80 were selected through Oxford ...

متن کامل

Interactive Effect of Pragmatic Eliciting Tasks on EFL Pre-intermediate Learners' Speaking Proficiency

The present study investigated the effect of pragmatic eliciting tasks on EFL pre-intermediate learners speaking proficiency. Thus this study aimed at comparing the English language learners who practiced pragmatic eliciting tasks and the ones who used traditional speaking activities such as questions and answers, discussion, etc. In doing so, 40 learners out of 80 were selected through Oxford ...

متن کامل

Prosodic Cues as Basis for Restructuring

In most of the cases spontaneaously uttered units of speech (e.g. in face-to-face dialogues) contain performance phenomena like repairs, breaking offs, omissions and others that motivate a restructuring procedure which allows storage or further processing of the input. In our view, this restructuring procedure is based on the segmentation of the input into a set of functional (i.e. meaningful) ...

متن کامل

What makes a word : Learning base units in Japanese for speechrecognitionLaura

We describe an automatic process for learning word units in Japanese. Since the Japanese orthography has no spaces delimiting words, the rst step in building a Japanese speech recognition system is to deene the units that will be recognized. Our method applies a compound-nding algorithm, previously used to nd word sequences in English, to learning syllable sequences in Japanese. We report that ...

متن کامل

Speech recognition using syllable-like units

It is well known that speech is dynamic and that framebased systems lack the ability to realistically model the dynamics of speech. Segment-based systems o er the potential to integrate the dynamics of speech, at least within the phoneme boundaries, although it is di cult to obtain accurate phonemic segmentation in uent speech. In this paper we propose a new approach which uses syllable-like un...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017